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The upstream end of the 3' untranslated region (UTR) of the mouse hepatitis virus genome contains two 
essential and overlapping RNA secondary structures, a bulged stem-loop and a pseudoknot, which have been 
proposed to be elements of a molecular switch that is critical for viral RNA synthesis. It has previously been 
shown that a particular six-base insertion in loop 1 of the pseudoknot is extremely deleterious to the virus. We 
have now isolated multiple independent second-site revertants of the loop 1 insertion mutant, and we used 
reverse-genetics methods to confirm the identities of suppressor mutations that could compensate for the 
original insertion. The suppressors were localized to two separate regions of the genome. Members of one class 
of suppressor were mapped to the portions of gene 1 that encode nsp8 and nsp9, thereby providing the first 
evidence for specific interactions between coronavirus replicase gene products and a ds-acting genomic RNA 
element. The second class of suppressor was mapped to the extreme 3' end of the genome, a result which 
pointed to the existence of a direct base-pairing interaction between loop 1 of the pseudoknot and the genomic 
terminus. The latter finding was strongly supported by phylogenetic evidence and by the construction of a 
deletion mutant that reduced the 3' UTR to its minimal essential elements. Taken together, the interactions 
revealed by the two classes of suppressors suggest a model for the initiation of coronavirus negative-strand 
RNA synthesis. 


RNA virus genomes contain cis- acting sequences and struc¬ 
tures that interact with viral and cellular components to initiate 
a variety of essential functions. Such interactions can mediate 
RNA replication and transcription, the selective translation 
of viral transcripts, and the specific packaging of progeny 
genomic RNA (gRNA) into assembling virions. Initiation of 
RNA virus genome replication occurs at the extreme 3' end of 
gRNA in order to produce a strand of complementary polarity. 
For positive-strand RNA viruses, the initiation of negative- 
strand RNA synthesis requires the specific recognition of the 
viral gRNA template and the juxtaposition of replicative pro¬ 
teins and the 3' genomic terminus, cis- acting sequences are 
believed to play pivotal roles in these processes. An intriguing 
example of how cis- acting sequences can coordinate the assem¬ 
bly and targeting of a replication complex to the initiation site 
of negative-strand RNA synthesis has been provided by studies 
of poliovirus replication (4, 23, 57). A crucial initial event in 
this system is the formation of a ribonucleoprotein complex 
around a cloverleaf RNA structure at the 5' end of the polio¬ 
virus genome. This complex consists of the poliovirus protease- 
polymerase precursor 3 CD and the cellular poly(rC) binding 
protein, both of which bind to the 5' cloverleaf (1, 2, 7, 17). In 
order to target poliovirus 3 CD to the initiation site of negative- 
strand RNA synthesis, another cellular protein, poly(A) bind¬ 
ing protein, becomes involved. Poly(A) binding protein is as- 
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sociated with the poly(A) tail of the poliovirus genomic RNA 
(which also serves as a cis- acting replication element) and 
bridges the 5' and 3' genomic termini by binding to the 
poly(rC) binding protein and 3CD (4, 23). Thus, in this exam¬ 
ple, cis- acting elements provide not only the basis for the 
discrimination between viral and cellular RNA but also a plat¬ 
form for the assembly of a negative-strand replication initia¬ 
tion complex. 

Coronaviruses are a family of positive-strand RNA viruses 
possessing a complex, multicomponent RNA synthetic machin¬ 
ery and a sophisticated program of gene expression that is 
characterized by the transcription of a 3'-nested set of sub- 
genomic RNAs (sgRNAs), in addition to gRNA replication 
(41). These features are shared with other members of the 
nidovirus order, which also includes arteriviruses, toroviruses, 
and roniviruses. For these viruses, the synthesis of both gRNA 
and sgRNA begins with the initiation of negative-strand RNA 
at the 3' end of the genome (49). A prerequisite for this event 
is the translation of the replicase gene from the positive-strand 
RNA genome, resulting in two large polyproteins. Extensive 
self-processing by viral proteinases residing within the polypro¬ 
teins eventually generates a number of processing intermedi¬ 
ates and 15 to 16 mature end products, termed nonstructural 
proteins (nsp’s) 1 to 16 (nspl to nspl6) (67). Many nsp’s 
colocalize, together with the structural nucleocapsid (N) pro¬ 
tein, at double-membrane vesicles to form the membrane- 
associated replication complex (8,10,14, 22, 53). Although this 
complex is believed to interact with ds-acting elements in or¬ 
der to replicate and transcribe viral RNAs (9), our knowledge 
about specific interactions that may be involved in the various 
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FIG. 1. Structure of the MHV 3' UTR. The organization of the 31.3-kb MHV genome is shown at the bottom. Above this, at the right, is a 
schematic of the current model of the RNA secondary structure of the entire 301-nt 3' UTR. The 3' UTR comprises an essential bulged stem-loop 
(BSL; nt 234 through 301) (18, 26, 27) and overlapping pseudoknot (PK; nt 185 through 238) (18, 60, 63). Downstream of these structures is a 
nonessential HVR (nt 46 through 156), which contains the highly conserved coronavirus octanucleotide motif (oct), 5'-GGAAGAGC-3' (19, 39). 
The arrangements shown for the segment downstream of the HVR (nt 1 through 67) and for part of the segment between the HVR and the 
pseudoknot (nt 143 through 160) are those proposed by Liu and coworkers (38) but are modified in the work reported herein. At the upper left 
is a detailed view of the upstream end of the 3' UTR, denoting the pseudoknot stems (SI and S2) and loops (LI and L2). Broken lines represent 
alternative base pairings either for the bottom stem segment of the bulged stem-loop or for pseudoknot stem 1. Also indicated is the 6-nt insertion 
(AACAAG) in pseudoknot loop 1 that was constructed in the mutant Alb391 (18). All nucleotide numbering is from the first base at the 3' end 
of the genome, excluding poly(A), and the N-gene stop codon is boxed. 


steps of coronavirus RNA synthesis is extremely limited. RNA- 
binding activity has been ascribed to a number of replicase 
gene products (16, 32, 55, 56, 66) and to the N protein (41). 
However, currently available structural and biochemical data 
do not yet allow the assignment of any of these RNA-binding 
proteins to defined interactions with cis -acting elements. Ad¬ 
ditionally, it is not known whether cellular proteins that have 
been shown to bind to viral RNA, such as PTB (28), hnRNP 
A1 (29), and a complex of mitochondrial aconitase and chap¬ 
erones (45, 46, 64, 65), are critically involved in coronavirus 
RNA synthesis. 

A fairly comprehensive view of cis -acting elements located at 
the 3' end of the coronavirus genome has come from studies of 
mouse hepatitis virus (MHV) and its close relative, bovine 
coronavirus (BCoV) (9, 42). Both MHV and BCoV are mem¬ 
bers of the second of the three phylogenetic groups into which 
coronaviruses are sorted. For MHV, work with viral mutants 


has demonstrated that the 3' genomic cis -acting elements es¬ 
sential for RNA synthesis lie entirely within the 301-nucleotide 
(nt) 3' untranslated region (3' UTR) (13, 18), even though 
earlier studies with defective interfering (DI) RNAs had sug¬ 
gested a potential role for a portion of the adjacent N gene (33, 
36, 40). The current picture of the MHV 3' UTR, shown in Fig. 
1, began with the definition of structures at its upstream end. 
Immediately following the N-gene stop codon is a bulged 
stem-loop (27). Adjacent to this is a classical hairpin-type 
pseudoknot (60), first discovered in BCoV. Exhaustive genetic 
analyses, in addition to chemical and enzymatic probing, have 
confirmed the existence and the essential nature of these two 
structures both in DI RNAs and in the intact viral genome (18, 
26, 27, 60). Most significantly, the bottommost stem of the 
bulged stem-loop and stem 1 of the pseudoknot overlap, and 
thus, the two cannot simultaneously fold to completion. Con¬ 
sequently, we have previously proposed that these alternative 
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structures are components of a molecular switch regulating 
a transition between individual steps of viral RNA synthesis 
(18, 26). 

Overlapping bulged stem-loops and pseudoknots are main¬ 
tained in the upstream 3' UTRs of all group 2 coronaviruses, 
including severe acute respiratory syndrome coronavirus 
(SARS-CoV) (18, 20), human coronavirus HKU1 (61), and the 
recently discovered bat coronaviruses HKU4, HKU5, and 
HKU9 (62). (Although it is stated otherwise in reference 62, 
HKU9 clearly harbors a pseudoknot, running from nucleotide 
28898 through 28947, and two candidate overlapping bulged 
stem-loops can be found upstream of it.) This structural con¬ 
servation, despite considerable primary sequence divergence, 
strongly implies functional conservation. More to the point, it 
has been shown that the entire 3' UTR of either BCoV (27) or 
SARS-CoV (20) can functionally replace the MHV 3' UTR. 
Similarly, multiple group 2 coronaviruses closely related to 
MHV and BCoV are competent to serve as helper viruses for 
the replication of a BCoV DI RNA (63). 

Most of the downstream remainder of the MHV 3' UTR is 
composed of a hypervariable region (HVR) (Fig. 1); the HVR 
is highly divergent in both sequence and structure, even among 
closely related coronaviruses like MHV and BCoV. The struc¬ 
ture of the MHV HVR has been confirmed by enzymatic 
probing (38). Detailed genetic analysis has shown that unlike 
the RNA elements in the upstream portion of the 3' UTR, the 
HVR can be mutated or entirely removed from the viral ge¬ 
nome with little effect on virus replication in tissue culture 
(19). The nonessential nature of the HVR is surprising, since 
the region harbors the octanucleotide 5'-GGAAGAGC-3', 
which is almost universally conserved among coronaviruses 
and is always situated some 70 to 80 nt from the 3' end of the 
genome. Although deletion of the HVR greatly attenuates 
MHV virulence in the mouse host, the HVR’s dispensability 
with respect to in vitro replication greatly simplifies consider¬ 
ation of the basic mechanism of RNA synthesis. 

In our previous mutational analysis of the bulged stem-loop 
and pseudoknot, we observed that loop 1 of the pseudoknot is 
extremely sensitive to insertions of various sizes (18). A thresh¬ 
old of viability was seen to occur at an insertion size of six 
nucleotides: an insertion one base smaller, AACAA, was fully 
tolerated, whereas an insertion one base larger, AACAACA, 
was lethal. One particular 6-nt insertion, AACAAA, had only 
a minimal effect on the viral phenotype. By contrast, a mutant 
designated Alb391, with a 6-nt insertion of AACAAG, was 
extremely unstable. Alb391 formed very small plaques com¬ 
pared to those of the wild type, and it gave rise to large-plaque 
revertants within a single passage. In the work described here, 
we carried out an analysis of multiple revertants of Alb391. 
The results of this study provide the first indication of an 
interaction between a 3' cis -acting element required for viral 
RNA synthesis and components of the coronavirus replicase. 
In addition, our results point to an interaction between stem 1 
of the pseudoknot and the extreme 3' terminus of the MHV 
genome. These two sets of interactions suggest a model for the 
initiation of viral negative-strand RNA synthesis. 

MATERIALS AND METHODS 

Cells and viruses. Wild-type MHV-A59, MHV mutants, and revertants were 
propagated in mouse 17 clone 1 (17C11) cells. MHV plaque assays and plaque 


purifications were carried out in mouse L2 or L929 cells, which were overlaid 
with 1% Noble agar and stained with neutral red at 48 h postinfection. The 
interspecies chimeric virus fMHV.v2 (18) was grown in feline FCWF cells. 
Vaccinia viruses were propagated in CV-1 and BHK-21 cells, and plaque puri¬ 
fications were performed on CV-1 and D980R cells (11). 

Vaccinia virus-based MHV reverse genetics. The recombinant MHV mutants 
rAlb391, rN8-Alb391, rN9-Alb391, rN8, and rN9 were derived from vaccinia 
virus vMHV-inf-1, which contains a cloned full-length MHV-A59 cDNA 
(GenBank accession number AY700211). Mutagenesis was done using the re- 
verse-genetics system described previously (11, 24). Briefly, two rounds of vac¬ 
cinia virus-mediated homologous recombination were performed using the Esch¬ 
erichia coli guanine-phosphoribosyltransferase (GPT) gene as a selection marker. 
First, the target region within the MHV cDNA was replaced by the GPT gene, 
and second, the GPT gene was replaced by a mutated version of the target 
region. To construct recombinant vaccinia viruses encoding the Alb391-specific 
AACAAG insertion in the MHV 3' UTR, we used the plasmid pRec5 for 
recombination with vaccinia virus vMHV-inf-1. The GPT gene in pRec5 is 
flanked on its left by MHV-A59 nt 28510 to 29024 and on its right by 500 bp of 
vaccinia DNA that is encoded downstream of the MHV-A59 3' terminus in 
vMHV-inf-1. The resulting recombinant vaccinia virus, vRec5, was used for 
recombination with the plasmid pRec7 to insert the Alb391-specific 3' terminus. 
The plasmid pRec7 contains MHV Alb391 nt 28510 to 31335, followed by a 
synthetic poly(A) sequence and 500 bp of vaccinia virus DNA. The resulting 
recombinant vaccinia virus clone was designated vMHV-rAlb391. To mutagenize 
nsp8 and nsp9, the region encoding MHV nsp7 to nspll was replaced by GPT, 
using vaccinia virus-mediated homologous recombination of vMHV-inf-1 or 
vMHV-rAlb391 with plasmid pRec3. The plasmid pRec3 encodes the GPT gene 
flanked on its left by MHV-A59 nt 11464 to 11956 and on its right by MHV-A59 
nt 13594 to 14151. The resulting recombinant vaccinia viruses were then recom¬ 
bined with reverse transcription (RT)-PCR-derived DNA fragments obtained 
from RNA isolated from the Alb391 revertant Rev2 or Rev3. The resulting 
recombinant vaccinia viruses thus produced were designated vMHV-rN8 (carry¬ 
ing the Rev2 mutations in nsp8 and the wild-type MHV 3' UTR), vMHV-rN9 
(carrying the Rev3 mutation in nsp9 and the wild-type MHV 3' UTR), vMHV- 
rN8-Alb391 (carrying the Rev2 mutations in nsp8 and the Alb391 3' UTR), and 
vMHV-rN9-Alb391 (carrying the Rev3 mutation in nsp9 and the Alb391 3' 
UTR). The identities of plasmids and recombinant vaccinia viruses were con¬ 
firmed by sequencing analysis of the mutated regions. Further cloning details, 
plasmid maps, and sequences are available from the authors upon request. 

Recombinant coronaviruses were rescued from cloned cDNA using purified, 
Eagl-cleaved vaccinia virus DNA as a template for the in vitro transcription of 
recombinant full-length MHV genomic RNA, which was electroporated into 
BHK-MHV-N cells as described previously (11). Following electroporation, the 
transfected BHK-MHV-N cells were mixed with a fourfold excess of murine 
17C11 cells and cultured at 37°C. At days 1 and 2 postelectroporation, tissue 
culture supernatants were taken for further analysis. 

Targeted RNA recombination. For the construction of the AHVR3 mutant and 
the reconstruction of the 3' UTR A2G mutation, targeted RNA recombination 
with host range-based selection was used, as described in detail previously (13, 
18, 19, 20, 34, 35, 43). In brief, monolayers of feline FCWF cells were infected 
with fMHV.v2 and were then transfected by electroporation with capped, T7 
polymerase-transcribed donor RNAs (mMessage mMachine transcription kit; 
Ambion). Templates for donor RNAs were derived from pSG6 (18) via PCR- 
based mutagenesis methods, which replaced plasmid segments between unique 
restriction sites in the 3' UTR or the downstream polylinker. pSG6 contains a 
cDNA segment of the 5'-most 0.5 kb of the MHV genome linked to the 3'-most 
8.6 kb of the MHV genome. Progeny virus from infected and transfected feline 
cells were selected on the basis of having regained the ability to form plaques on 
murine L2 cell monolayers, and the resulting recombinant candidates were 
purified through two rounds of plaque titration. For mutants constructed by 
targeted RNA recombination, the wild-type virus used for comparisons was 
Alb240, a well-characterized isogenic recombinant that was previously recon¬ 
structed from fMHV and pMH54 donor RNA (35). 

Characterization of mutants and revertants. For analysis of the sequences of 
constructed mutants and isolated revertants, total cellular RNA was extracted 
from infected monolayers and reverse transcribed with avian myeloblastosis virus 
(Life Sciences) or Superscript II (Invitrogen) reverse transcriptase using random 
hexanucleotide (Boehringer Mannheim) or MHV sequence-specific primers. 
Regions of the genome were then amplified by PCR under standard conditions 
using AmpliTaq polymerase (Roche), and PCR products were purified with 
Quantum-prep columns (Bio-Rad) prior to automated sequencing. For viruses 
for which the entire genome was sequenced (Rev2, Rev5, and Rev6), the extreme 
5'-end sequence was obtained through rapid amplification of cDNA ends (573' 
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RACE kit; Roche). In order to resolve the mixed sequence obtained for the 
mutant rAlb391 isolate S3, we cloned RT-PCR products using the pGEM-T Easy 
Vector System (Promega), and individual plasmids were subsequently se¬ 
quenced. 

For the measurement of growth kinetics, confluent monolayers of 17C11 cells 
were inoculated at a multiplicity of 0.01 or 5.0 PFU per cell (for the AHVR3 
mutant) or at a multiplicity of 1.0 PFU per cell (for all other mutants) for 2 h at 
37°C. Inocula were removed, monolayers were washed three times, and incuba¬ 
tion was continued in fresh medium at 37°C. Sample aliquots of medium were 
then withdrawn at various times from 2 to 32 h postinfection, and infectious titers 
were subsequently determined on mouse L2 cells. 

RNA structural predictions were carried out with the mfold software (44, 68), 
available at http://www.bioinfo.rpi.edu/applications/mfold. 

Radiolabeling of viral RNA. Intracellular viral RNA was metabolically labeled 
in 17C11 cells as described previously (18-20, 27). In brief, 20-cm 2 confluent 
monolayers were infected with the AHVR3 mutant or wild-type MHV at a 
multiplicity of 5.0 PFU per cell at 37°C. Infected cells were starved from 2 h 
through 8 h postinfection in Eagle’s minimal essential medium containing 5% 
dialyzed fetal bovine serum with 1/10 of the normal concentration of phosphate. 
Cells were then labeled from 8 h through 10 h postinfection in 1 ml phosphate- 
free medium containing 100 |xCi per ml [ 33 P]orthophosphate (MP Biomedicals), 
5% dialyzed fetal bovine serum, and 20 |xg of actinomycin D (Sigma) per ml. 
Total cytoplasmic RNA was purified, and samples were analyzed by electro¬ 
phoresis through 1% agarose containing formaldehyde and were visualized by 
fluorography. 


RESULTS 

Initial revertant analysis of a 3' UTR pseudoknot mutant. In 

a previous study, we used targeted RNA recombination to 
construct a number of mutants of the MHV 3' UTR bulged 
stem-loop and pseudoknot in order to obtain an understanding 
of the structural requirements of these two elements and their 
interrelationship (18). An additional goal was to generate se¬ 
verely impaired viral mutants that might direct us to genetic 
interactions between the 3' UTR and other regions of the 
genome. A mutant that met the latter criterion was Alb391, in 
which an insertion of 6 nt, AACAAG, was made between bases 
226 and 227 of pseudoknot loop 1 (Fig. 1). Alb391 had a 
markedly defective phenotype, exhibiting slow growth and 
forming much smaller plaques than did wild-type virus at 33°C, 
37°C, and 39°C. Revertants of this mutant were rapidly se¬ 
lected in as few as one or two passages. Six independent re¬ 
vertants (Revl to Rev6), which formed plaques identical to or 
slightly smaller than those of the wild type, were isolated and 
analyzed by partial or total genomic sequencing (Table 1). One 
of these, Revl, had lost the first five bases of the insertion, 
retaining a footprint of only the final G residue. This finding 
constituted strong evidence that the 6-nt insertion, rather than 
some extraneous mutation, was indeed responsible for the 
Alb391 phenotype. 

All of the other revertants retained the complete 6-nt inser¬ 
tion, indicating that second-site suppressor mutations had to 
account for their observed reversion. For one of these, Rev2, 
we determined the sequence of the entire viral genome and 
compared it to the particular database reference sequence 
(GenBank accession number NC001846) that is closest to the 
laboratory strain from which Alb391 had been derived. A total 
of nine mutations, two synonymous and seven nonsynonymous, 
were found relative to the reference sequence. However, all 
except two of these base changes also occurred in the corre¬ 
sponding wild-type virus or in the immediate fMHV precursor 
to Alb391, and they were therefore not associated with rever¬ 
sion. The two remaining mutations produced a pair of closely 


TABLE 1. Sequence analysis of initial set of candidate reverting 

mutations of Alb391 a 


Mutant 

Gene or 
region 

Nucleotide change(s) fo 

Amino acid 
ehange(s) c 

Revl 

3' UTR 

Deletion of nt 1 through 5 of 
the 6-nt insertion between 
bases 226 and 227 

None 

Rev2 

nsp8 

G12439A, C12452U 

D62N, T66I 

Rev3 

nsp9 

U13017G 

N60K 

Rev4 

nsp8 

A12344G 

Q30R 

Rev5 

3' UTR 

A2G 

None 

Rev6 

3' UTR 

A2G 

None 


a For Rev2, Rev5, and Rev6, the entire genomes were sequenced. For Revl, 
Rev3, and Rev4, the genomic regions sequenced were as follows: the 5'-most 510 
bases, the segment of gene la encoding nsp7 through nsplO, and the entire 
3' UTR. 

b Nucleotide numbering is from the 5' end of the genome for replicase (gene 
1) mutations and from the 3' end of the genome, excluding poly(A), for 3' UTR 
mutations. All mutants, except Revl, retained the 6-nt insertion (AACAAG) 
between bases 226 and 227 of the 3' UTR. 

c Amino acid numbering indicates the residue position in each processed 
replicase protein product. 


spaced coding changes in nsp8 (Table 1), which is one of a 
group of four small processed polypeptide products encoded 
by the downstream-most end of replicase gene la. Similarly, 
sequencing of the nsp7-through-nspl0 region of both Rev3 and 
Rev4 revealed single coding mutations in nsp9 and in nsp8, 
respectively (Table 1). Notably, both nsp8 and nsp9 have been 
described as RNA-binding proteins, and nsp8 has been shown 
to possess an RNA-dependent RNA polymerase (RdRp) ac¬ 
tivity (16, 30, 56, 66). By contrast, Rev5 and Rev6 were found 
to have no mutations in the nsp7-through-nspl0 region, but 
each had the same A-to-G transition at the penultimate nu¬ 
cleotide preceding the poly(A) tail (denoted A2G in the num¬ 
bering system used for the 3' UTR) (Table 1). Initially we did 
not consider this mutation to be significant, since A2G is dis¬ 
tant from the pseudoknot and since the same base change 
occurs naturally in MHV strain 3 (47). Accordingly, we se¬ 
quenced the entire genomes of Rev5 and Rev6 and found no 
other base changes. Thus, our initial revertant analysis de¬ 
tected three classes of candidate reverting mutations for the 3' 
UTR pseudoknot insertion in Alb391: one mapping to the 
original site of the insertion, a second falling within the repli¬ 
case gene, and the last near the 3' genomic terminus. 

Reconstruction of AIb391 by full-length cDNA cloning. To 
test whether replicase gene mutations, mapping some 20 kb 
distant from the original lesion, were responsible for reversion 
of the Alb391 mutation, we aimed to reconstruct a subset of 
the revertants. To accomplish this, we decided to use the vac¬ 
cinia virus-based complete reverse genetics system for MHV 
(11, 58), which, unlike targeted RNA recombination (43), al¬ 
lows access to the huge coronavirus replicase gene. However, 
we first needed to clarify whether the Alb391-specific 6-nt 
insertion would confer the same phenotype in the genetic 
background of the recombinant MHV-inf-1 molecular clone. 
Therefore, we generated the 6-nt pseudoknot insertion muta¬ 
tion in the MHV-inf-1 genome and analyzed the rescued mu¬ 
tant, designated rAlb391. In accord with previous results, the 
reengineered mutant was found to be extremely unstable, even 
at the earliest stages of recovery. As shown in Fig. 2A, rAlb391 
virus obtained from N protein-expressing cells that had been 
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FIG. 3. Growth kinetics for small-plaque isolates SI to S3 of rAlb391, compared with a wild-type control. Cultures were inoculated with 
passage-2 virus (A) or passage-6 virus (B) at a multiplicity of 1.0 PFU per cell. Data shown are a set from one representative of duplicate 
experiments, which gave highly similar results. 


transfected with synthetic mutant genomic RNA had a mixture 
of mutant and wild-type sequence at the position of the inser¬ 
tion (and consequently at all points downstream in a positive- 
strand sequence). Plaques of rAlb391 exhibited considerable 
heterogeneity, ranging from very small to wild-type size (data 
not shown). To analyze this in more detail, we picked 3 of the 
smallest plaques (SI through S3) and 2 of the largest plaques 
(LI and L2) and propagated them for 6 and 10 passages, 
respectively. Isolates LI and L2 formed uniformly large 
plaques at passages 2 and 10 (Fig. 2B), and sequence analysis 
revealed that they had exactly lost the 6-nt insertion (data not 
shown); this identified them as precise primary-site revertants. 

By contrast, isolates SI through S3 had a variety of plaque 
sizes by passage 2. By passage 6, they formed fairly uniform 
plaques that were the same size, or nearly the same size, as 
those of the wild type (Fig. 2B). Sequencing showed that this 
stability had been achieved through the acquisition of one of a 
number of new mutations. In the stock of isolate SI, the 6-nt 3' 
UTR pseudoknot insertion was retained. However, by passage 
6, a previously unseen nsp9 mutation had been selected 
(A13010G), resulting in the amino acid change D58G (Fig. 
2C). For isolate S2, the passage-6 stock was composed of a new 
type of primary site footprint mutant, in which the last three 
bases of the 6-nt insertion (AAG), as well as the next down¬ 
stream base (U), were deleted (Fig. 2D). There were no addi¬ 
tional changes in either the nsp7-through-nspl0 region or the 
remainder of the 3' UTR of S2 passage 6. Finally, bulk se¬ 
quencing of the passage-6 stock of isolate S3 gave a mixture of 
sequences, which were resolved through TA cloning of RT- 
PCR products from the 3' UTR. Five out of nine TA clones 


were found to have the same 3-nt footprint of the original 
insertion that had been found in the S2 isolate by passage 6 
(Fig. 2E, left), with no other mutations in the 3' UTR. On the 
other hand, the remaining four TA clones showed an intact 
6-nt insertion, accompanied by the same A2G mutation that 
had been found in Rev5 and Rev6 (Fig. 2E, right). The S3 
passage-6 stock had no additional changes in the nsp7- 
through-nsplO region of the genome. In accord with the ap¬ 
parently strong selective pressure against the original Alb391 
mutation, the growth of passage-2 stocks of the SI through S3 
isolates was greatly impaired relative to that of the wild type 
for at least the first 11 h of infection (Fig. 3A). Later, by 26 h 
postinfection, SI through S3 mutant titers appeared to ap¬ 
proach that of the wild type, but this may have reflected the 
emergence and overrunning of the cultures by revertants. By 
passage 6, the small-plaque isolates replicated with kinetics 
comparable to that of the wild type (Fig. 3B). As expected, the 
growth kinetics for either the passage-2 or passage-10 stocks of 
the LI and L2 isolates were indistinguishable from the kinetics 
for the wild type (data not shown). 

Taken together, these results showed that the AACAAG 
insertion mutation in pseudoknot loop 1 conferred the same 
phenotype in the MHV-inf-1 genetic background as had been 
seen in the initial Alb391 mutant (18). Furthermore, the anal¬ 
yses of the Alb391 and rAlb391 revertants revealed the same 
three classes of compensatory mutations, as summarized in 
Fig. 4A. First, there were primary-site revertants, in which 
there occurred loss of the entirety or a portion of the 
AACAAG insertion, as in rAlb391 isolates LI and L2 or in the 
two types of footprint mutations in Revl and rAlb391 isolates 


FIG. 2. Reconstruction of the Alb391 3' UTR insertion mutant by using the vaccinia virus-based reverse-genetics system for MHV. (A) Se¬ 
quence of RT-PCR product from the 3' UTR of the initially recovered mutant, rAlb391. The position of the AACAAG insertion is denoted, and 
mixed sequence downstream of the site of the insertion is boxed. (B) Plaques from passages 2 and 6 of three small-plaque isolates of rAlb391 (SI 
to S3), compared with plaques from passages 2 and 10 of two large-plaque isolates of rAlb391 (LI and L2) and a wild-type control. (C) Sequences 
of RT-PCR products from the pseudoknot region (3' PK) of the 3' UTR and nsp9 from passages 2 and 6 of the SI isolate; note that the nsp9 
sequence is the negative sense. (D) Sequences of RT-PCR products from the pseudoknot region of the 3' UTR from passages 2 and 6 of the S2 
isolate. (E) Sequences of the pseudoknot region and the downstream terminus (3' end) of the 3' UTR of representative TA clones from passage 
6 of the S3 isolate. In C to E, the insertion or its remnant is boxed, and mutant nucleotides or their wild-type counterparts are circled. 
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FIG. 4. Genetic cross talk between loop 1 of the 3' UTR pseudoknot and both gene la and the 3' end of the MHV genome. (A) Summary of 
all types of reverting mutations from the original Alb391 mutant and the reconstructed mutant rAlb391. In the expansion of the region of the 
genome encoding nsp7 to nsplO, main protease cleavage sites (8, 52) are denoted by open arrowheads and the flanking amino acid residues are 
indicated; alternate names for the MHV nsp7-to-nspl0 polypeptides are given in italics. In the expansions of regions of the 3' UTR, nucleotide 
numbering is from the first base at the 3' end of the genome, excluding poly(A). The original 6-nt insertion and remnants thereof are boxed; 
mutated nucleotides are shown in bold. For each type of reverting mutation, the designations of independently obtained isolates are given in 
parentheses. (B) Loci of the reverting mutations in nsp8. The schematic of the secondary structure of nsp8 of SARS-CoV corresponds to the crystal 
structure reported by Zhai and coworkers (66). (C) Loci of the reverting mutations in nsp9. The schematic of the secondary structure of nsp9 of 
SARS-CoV corresponds to the crystal structure reported by Eglolf and coworkers (16), but it is also substantially the same as that reported by 
Sutton and coworkers (56). Beneath the schematics in panels B and C are alignments for portions of nsp8 and nsp9 of coronaviruses spanning all 
three phylogenetic groups. GenBank accession numbers for the sequences shown are as follows: MHVA, MHV strain A59, AY700211; BCVM, 
bovine coronavirus strain Mebus, U00735; HKU1, human coronavirus strain HKU1, AY597011; SARS, SARS coronavirus strain Urbani, 
AY278741; IBVB, infectious bronchitis virus strain Beaudette, M94356; 229E, human coronavirus strain 229E, AF304460; TGEV, transmissible 
gastroenteritis virus strain Purdue, AJ271965; NL63, human coronavirus strain NL63, AY567487; and PEDV, porcine epidemic diarrhea virus 
strain CV777, AF353511. MHV mutant Alb391 reverting mutations are boxed above each alignment. Filled circles mark residues that are identical 
among at least seven of the nine sequences; open circles mark additional residues that are identical among all group 2 coronaviruses. Amino acid 
numbering indicates the residue position in each processed replicase protein product. 
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S2 and S3. Second, four independent reverting mutations, 
Rev2 through Rev4 and rAlb391 isolate SI, mapped to three 
loci in nsp8 and nsp9, altering moderately conserved residues 
in these RNA-binding proteins. Finally, three independent re- 
vertants, Rev5, Rev6, and rAlb391 isolate S3, each had a single 
point mutation, changing the second residue of the 3' UTR 
from A to G. 

Interactions between the 3' UTR pseudoknot and replicase 
gene products. Having shown that Alb391 and the recon¬ 
structed rAlb391 mutants displayed the same phenotype, we 
next examined the effects of reconstruction of two of the gene 
la candidate reverting mutations in both the presence and the 
absence of the 3' UTR pseudoknot AACAAG insertion. For 
this purpose, we chose the pair of nsp8 mutations from Rev2 
(G12439A and C12452U) and the single nsp9 mutation from 
Rev3 (U13017G) (Table 1). We analyzed three independent 
isolates bearing the nsp8 mutations in the presence of the 3' 
UTR insertion (mutant rN8-Alb391) and two isolates bearing 
the nsp8 mutations alone (mutant rN8). Likewise, two isolates 
each were analyzed for the nsp9 mutation in the presence 
(mutant rN9-Alb391) or the absence (mutant rN9) of the 3' 
UTR insertion (Fig. 5A). The rN8-Alb391 and rN9-Alb391 
reconstructed revertants produced plaques that were only 
slightly smaller than those of the wild type (Fig. 5B), demon¬ 
strating that the nsp8 and nsp9 mutations were indeed respon¬ 
sible for the suppression of the Alb391 phenotype in Rev2 and 
Rev3. In general, the rN8-Alb391 and rN9-Alb391 mutants 
were stable for up to 10 passages: the nsp8 or nsp9 mutation 
remained unchanged, and the 6-nt insertion in the 3' UTR was 
retained. However, an exception to this trend was that for 2 of 
the rN8-Alb391 isolates, the 6th residue of the 3' UTR inser¬ 
tion (G) was gradually replaced by A over the course of 10 
passages. An example of this replacement is shown in Fig. 5C. 
This suggested that the nsp8 mutations could not optimally 
compensate for the distortion of the 3' UTR pseudoknot and 
that there was some residual selective pressure to mitigate the 
effect of the AACAAG insertion. The suppression of the 
Alb391 phenotype was also evident in the growth kinetics for 
the rN8-Alb391 and rN9-Alb391 mutants, which closely resem¬ 
bled the kinetics for the wild type at both passage 2 and pas¬ 
sage 10 (Fig. 5D and E). At early times postinfection, both 
mutants lagged slightly behind the wild type, but they were far 
more robust than the passage-2 small-plaque isolates of 
rAlb391 (Fig. 3). 

Somewhat differently, plaques of the rN8 and rN9 mutants 
were of fully wild-type size, indicating that the suppressor mu¬ 
tations, in isolation, were not noticeably deleterious to the 
virus (Fig. 5B). Sequence analysis confirmed this inference. 
The nsp8 and nsp9 mutations were stable over 10 passages, and 
their presence did not select for any compensating changes 
in the 3' UTRs of these viruses. In addition, the growth kinetics 
for the rN8 and rN9 mutants were highly similar to that for the 
wild type (data not shown). 

Collectively, these data demonstrated that the mutated rep¬ 
licase gene products nsp8 and nsp9 are still functional in the 
context of both the wild-type 3' UTR and the Alb391 3' UTR. 
More importantly, it was directly proven, by reconstruction of 
the Alb391 revertants Rev2 and Rev3, that the replicase gene 
mutations mapping to nsp8 and nsp9 could compensate for the 
markedly defective phenotype of Alb391, thus making it very 


likely that the other two replicase gene mutations, found in 
revertants Rev4 and rAlb391 isolate SI, were also genuine 
suppressors of the severe growth defect mediated by the 6-nt 
pseudoknot insertion. These data were indicative of an inter¬ 
action of two replicase gene products, the RNA-binding 
proteins nsp8 and nsp9, with an essential cis -acting RNA 
pseudoknot located at the MHV 3' UTR. 

Interactions between the pseudoknot and the 3' end of the 
genome. The last class of revertant, the A2G mutation, was 
very surprising, because the poly(A)-adjacent end of the ge¬ 
nome was not obviously connected with the 3' UTR 
pseudoknot in previous RNA structural analyses. To confirm 
whether A2G was truly a reverting mutation, we reconstructed 
this base change in both the presence and absence of the 6-nt 
pseudoknot insertion, using targeted RNA recombination. As 
we had observed for the replicase gene mutations, the A2G 
mutation, on its own, produced no detectable phenotype (as is 
also the case for MHV strain 3 [47]). Similarly, recombinants 
containing A2G together with the pseudoknot insertion 
formed plaques that were only slightly smaller than plaques of 
the wild type and were rescued at a much higher frequency 
than Alb391 (data not shown). This proved conclusively that 
the apparently simple A2G mutation could also suppress the 
severe impairment caused by the AACAAG pseudoknot inser¬ 
tion. 

This result also provided the first indication of the existence 
of a direct interaction between pseudoknot loop 1 and the 3' 
genomic terminus. The folding of these regions that is shown in 
Fig. 1 is derived from a study (38) in which a structure running 
from nt 166 to the 3' end of the genome was predicted with the 
mfold software (44, 68). At the outset, the chosen 5' upper 
limit of nt 166 would have excluded potential interactions with 
further-upstream portions of the 3' UTR, including the 
pseudoknot. Indeed, we found that the lowest-free-energy 
mfold predictions obtained for the distal end of the 3' UTR, 
running instead from the start of pseudoknot loop 1 (nt 230) 
through the 3' terminus, support a different RNA secondary 
structure, which is shown in Fig. 6A. In this case, the 3'-most 
29 nt of the genome are seen to form two helices with further 
upstream regions, one with pseudoknot loop 1 and one with 
the segment immediately distal to pseudoknot stem 2. The 
likelihood of this alternative structure is further supported by 
its absolute phylogenetic conservation among all other known 
group 2 coronaviruses, including the closely related BCoV 
and human coronavirus OC43 as well as the more divergent 
SARS-CoV, human coronavirus HKU1, and bat coronaviruses 
HKU4, HKU5, and HKU9 (Fig. 6B). Thus, not only do mem¬ 
bers of this coronavirus subfamily strictly maintain the 3' UTR 
bulged stem-loop and pseudoknot but they also appear to be 
required to preserve base pairing between loop 1 of the 
pseudoknot and the 3' end of the genome. Similar loop 1-3'- 
end structures potentially occur in the genomes of the group 1 
and group 3 coronaviruses, but as yet no experimental evidence 
supports their existence. 

To further test the loop 1-3'-end interaction in MHV, we 
designed a mutant harboring a large deletion within the 3' 
UTR that would retain the two putative helices of the new 
model while eliminating the entire HVR and all of the adjacent 
region that was formerly thought to base pair with the 3' end 
of the genome. We had previously constructed a deletion mu- 
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FIG. 6. Interaction between loop 1 (LI) of the 3' UTR pseudoknot (PK) and the 3' end of the genome. (A) Refinement of the complete 
structure of the MHV 3' UTR, incorporating base pairing of the 3' end of the genome with both pseudoknot loop 1 and the region downstream 
of pseudoknot stem 2 (S2); other abbreviations are the same as those used in Fig. 1. Arrows indicate points at which truncations were introduced 
in the previously characterized AHVR2 deletion mutant (19) and in a new mutant, the AHVR3 mutant. (B) Phylogenetic conservation of the loop 
1-3'-end interaction among all group 2 coronaviruses. GenBank accession numbers for the sequences shown, other than those given in the legend 
to Fig. 4, are as follows: HCoV-OC43, human coronavirus strain OC43, AF523847; and bat coronavirus (BatCoV) strains HKU4, HKU5, and 
HKU9, respectively, EF065505, EF065509, and EF065513. (C) Detailed view of the complete 164-nt 3' UTR of the AHVR3 mutant. Broken lines 
represent alternative base pairings either for the bottom stem segment of the bulged stem-loop or for pseudoknot stem 1. Nucleotides of the 
tetraloop introduced between positions 29 and 171 are circled. Nucleotide numbering is from the first base at the 3' end of the genome, excluding 
poly(A), and the N-gene stop codon is boxed. 


tant, the AHVR2 mutant (Fig. 6A), in which 109 nt of the 3' 
UTR were excised with little consequence to the in vitro 
growth of the virus (19). In the new mutant, designated the 
AHVR3 mutant, we created an even more extensive deletion, 
of 141 nucleotides, in which all sequence between bases 29 and 
171 was replaced by a tetraloop capping the helix downstream 
of pseudoknot stem 2 (Fig. 6C). The AHVR3 mutant was not 


only viable but formed plaques that were 80% of the size of 
wild-type plaques and were larger than those of the previously 
characterized AHVR2 virus. Moreover, the AHVR3 mutant 
exhibited growth kinetics comparable to those of the wild type 
in single-step, high-multiplicity infections (Fig. 7A) and in mul¬ 
ticycle, low-multiplicity infections (Fig. 7B). Under both con¬ 
ditions, growth of the AHVR3 mutant lagged behind that of 


FIG. 5. Reconstruction of viruses harboring candidate gene la reverting mutations of Alb391. (A) Recombinants rN8-Alb391 and rN9-Alb391, 
respectively, contain the nsp8 and nsp9 mutations from revertants Rev2 and Rev3 in the presence of the original Alb391 3' UTR insertion 
mutation. Recombinants rN8 and rN9, respectively, contain the Rev2 and Rev3 mutations in the presence of the wild-type 3' UTR. (B) Plaques 
from passages 2 and 10 of rN8-Alb391, rN9-Alb391, rN8, and rN9, compared with a wild-type control; the wild-type control plaques are the same 
as those shown in Fig. 2. (C) Sequences of RT-PCR products from the pseudoknot region (3' PK) of the 3' UTR from passages 2, 7, and 10 of 
one rN8-Alb391 isolate. The variable base of the AACAAG insertion is denoted. (D and E) Growth kinetics for reconstructed revertants 
rN8-Alb391 and rN9-Alb391, compared with a wild-type control. Cultures were inoculated with passage-2 virus (D) or passage 10 virus (E) at a 
multiplicity of 1.0 PFU per cell. Data shown for each mutant are a set from one representative of duplicate experiments, which gave highly similar 
results. Also, duplicate sets from two additional independent isolates of rN8-Alb391 were essentially identical to the data shown. 
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FIG. 7. Replicational competence of the AHVR3 mutant. Growth kinetics of the AHVR3 mutant relative to that of the wild type at a high 
multiplicity of infection (5.0 PFU per cell) (A) or a low multiplicity of infection (0.01 PFU per cell) (B). Open and solid symbols represent results 
from two independent experiments. (C) RNA synthesis by the AHVR3 mutant and the wild type. Infected or mock-infected 17C11 cells were 
metabolically labeled with [ 33 P]orthophosphate in the presence of actinomycin D, and RNA was isolated and electrophoretically separated in 1% 
agarose containing formaldehyde as described in Materials and Methods. 


the wild type by 2 h but ultimately reached the same maximal 
titers. Additionally, metabolic labeling of infected cells re¬ 
vealed no significant difference in gRNA or sgRNA synthesis 
between the mutant and the wild type (Fig. 7C). Taken to¬ 
gether, these results showed that the 3' genomic cis -acting 
elements essential for MHV RNA synthesis are restricted to 
two separate segments of the 3' UTR, nucleotides 1 through 29 
and nucleotides 171 through 301. Surprisingly, therefore, al¬ 
most half of the 3' UTR is dispensable with respect to func¬ 
tions required for growth in tissue culture. 

DISCUSSION 

All RNA viruses harbor cis -acting sequences and structures 
in their genomes that are critical for replication. Within the 
order Nidovirales , 3' cis -acting elements are currently best 
characterized for the group 2 coronaviruses, in which an over¬ 
lapping bulged stem-loop and pseudoknot in the 3' UTR have 
been found to be essential for RNA synthesis (Fig. 1) (9, 42). 
In the group 3 coronaviruses, a functionally essential stem-loop 
is located at the upstream end of the 3' UTR (12), but the 
status of different nearby candidate pseudoknots is unresolved 
(20, 60). Conversely, the group 1 coronaviruses all contain a 
very highly conserved pseudoknot, but a neighboring potential 
counterpart of the bulged stem-loop (15) is not clearly identi¬ 
fiable in every member of this group. Over a greater phyloge¬ 
netic distance, in the arteriviruses, which have much smaller 3' 
UTRs, certain required cis -acting RNA structures have been 
described that extend into the upstream N gene (5, 6, 59). It is 
not obvious at present how closely analogous to coronavirus 3' 
elements these RNA structures will turn out to be, but the 
genomes of members of the two virus families might be ex¬ 
pected to interact in similar ways with their respective repli- 
cases, which are highly homologous (21). 

Interactions between the pseudoknot and the 3' end of the 
genome. In the work presented here, we analyzed revertants of 
a highly defective MHV mutant containing a particular inser¬ 
tion, AACAAG, in loop 1 of the 3' UTR pseudoknot (sum¬ 


marized in Fig. 4A). Our finding of six independent primary- 
site reverting mutations that deleted or altered part or all of 
the insertion confirmed that the insertion was, in actuality, the 
basis of the deleterious and unstable genotype of the original 
mutant. More informative, however, were two groups of sec¬ 
ond-site reverting mutations that were obtained in both the 
original mutant, Alb391, which had been constructed by tar¬ 
geted RNA recombination, and rAlb391, which was recon¬ 
structed by vaccinia virus-based reverse genetics. These two 
classes of suppressors mapped to distinct regions of the ge¬ 
nome, thereby revealing two types of previously unsuspected 
interactions. One class, independently isolated three times, was 
represented by a single type of revertant, in which the second 
3' UTR residue upstream of the poly(A) tail was changed from 
an A to a G. That this single base change could rescue the 
Alb391 insertion mutant suggested that the 3' end of the ge¬ 
nome must somehow communicate with loop 1 of the 
pseudoknot. This conclusion led us to identify a longer-range 
secondary structure in which the 3' end of the genome folds 
into two helices through base pairing with pseudoknot loop 1 
and with the region downstream of pseudoknot stem 2 (Fig. 6). 
Additional evidence in support of this interaction came from 
the following: (i) mfold -based structural predictions; (ii) the 
total conservation of the interaction among all known group 2 
coronaviruses (and possibly also among group 1 and group 3 
coronaviruses); (iii) our construction of a mutant, the AHVR3 
mutant, which preserves this base pairing in a drastically min¬ 
imized version of the 3' UTR; and (iv) our previous finding of 
the lethality of a more extensive deletion in the 5'-ward direc¬ 
tion, AHVR1, which suggested that at least some subset of 
nucleotides 157 through 184 must be essential (19). 

It is noteworthy that in our redefined folding of the 3' end of 
the genome, base A2 does not participate in a Watson-Crick 
base pair (Fig. 6). Thus, it is not immediately obvious how the 
A2G mutation relieves the problem caused by the AACAAG 
insertion. We favor the explanation that the insertion in loop 1 
forces an alternative folding of the pseudoknot-3' end interac- 
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tion and that the A2G mutation destabilizes this alternative 
structure, thereby allowing the wild-type fold to predominate. 
This notion is supported by the fact that the A2G mutation, in 
the context of an otherwise wild-type 3' UTR, has no detect¬ 
able effect on the viral phenotype. A potentially related obser¬ 
vation is that the mutation A2U, which is also neutral by itself, 
is lethal in combination with the AACAAG insertion (data not 
shown). Further mutational analysis will be needed to more 
completely define the sequence and structural requirements 
for the interaction between the extreme 3' end of the genome 
and pseudoknot loop 1, as well as the region downstream of 
pseudoknot stem 2. 

The viability and nearly normal phenotype of the AHVR3 
mutant indicate that the minimal 3' cis -acting element re¬ 
quired for MHV RNA synthesis is limited to no more than 160 
nt of the 3' UTR. This means that nearly half of the 3' UTR, 
bases 30 through 170, can be discarded without affecting the 
basic mechanism of viral RNA synthesis. The deleted region in 
the AHVR3 mutant encompasses in vitro binding sites for a 
number of host cell proteins, including PTB (27), hnRNP A1 
(28), and a complex of mitochondrial aconitase and the chap¬ 
erones mtHSP70, HSP60, and HSP40 (45, 46, 64, 65). It fol¬ 
lows that these binding sites cannot be strictly required for 
RNA synthesis, although they may play ancillary roles yet to be 
defined. Additionally, the region of the 3' UTR that was elim¬ 
inated in the AHVR3 mutant encompasses some RNA seg¬ 
ments that were previously proposed to base pair with the 3' 
end of the genome (38) (Fig. 1). Our redefined folding of the 
3' UTR is also at variance with an alternative ra/b/d-predicted 
structure proposed for solely the 3'-terminal 42 nt of the ge¬ 
nome (31). Notably, our results prompt reconsideration of the 
conclusions reached by a previous study, which mapped the 
promoter for negative-strand RNA synthesis based on RNase 
protection assays specific for the negative strand of a heterol¬ 
ogous RNA that was connected to progressively decreasing 
lengths of the MHV 3' UTR (37). That work concluded that 
the promoter for negative-strand initiation constitutes the last 
45 to 55 nt of the genome. However, it is now clear that the 
essential sequence at the 3' end of the genome is no more than 
29 nt, together with a poly(A) tail of at least 5 to 10 residues 
(54). Moreover, we suggest that nt 1 to 29, on its own, is not 
sufficient for the initiation of negative-strand RNA synthesis, 
but it likely acts in association with the pseudoknot. Such an 
association would normally occur in cis, but in the promoter 
mapping experiments (37), it must have occurred in trans, 
between the heterologous RNA and the helper virus genome. 

Interactions between the pseudoknot and gene la replicase 
products. The second class of suppressor mutations found in 
our genetic analysis was represented by four independently 
isolated revertants that mapped to nsp8 or nsp9. These repli¬ 
case components fall within a group of small proteins, nsp7 
through nsplO (each 10 to 22 kDa), that are encoded at the 
downstream end of gene la (Fig. 4A) (8, 52). All four are 
processed from the replicase polyproteins la and lab by the 
main viral proteinase (nsp5), and they localize in discrete peri¬ 
nuclear and cytoplasmic foci in infected cells as parts of the 
membrane-associated replication complex (8, 10, 14, 22, 53). 

Structural studies of each of the highly conserved SARS- 
CoV homologs of nsp7 through nsplO have recently been re¬ 
ported. SARS-CoV nsp9, the crystal structure of which was 


separately solved by two groups, has been shown to have non¬ 
specific RNA-binding activity and a fold not previously seen in 
any RNA-binding protein (16, 56). Nsp9 crystallizes in two 
dimeric forms, and biophysical evidence has also been pre¬ 
sented for the existence of an interaction between nsp9 and 
nsp8 (56). A hexadecameric complex of SARS-CoV nsp8 and 
nsp7 has been found to bind to double-stranded RNA. The 
crystal structure of this complex (66), partially confirmed by 
the nuclear magnetic resonance solution structure of nsp7 (48), 
is a hollow cylinder potentially able to encircle double- 
stranded nucleic acid. Within this ring, which resembles a pro- 
cessivity clamp, nsp8 monomers are folded in two different 
conformations. Finally, there are two independently solved 
structures of SARS-CoV nsplO, which reveal that it represents 
a novel class of zinc finger protein which assembles into a 
dodecameric complex having a weak affinity for single- and 
double-stranded RNA (32, 55). A classical MHV mutant, 
AFA6, which has the phenotype of shutoff of negative-strand 
RNA synthesis at the nonpermissive temperature (3, 51), has 
been mapped to nsplO (50), and reverse-genetics analysis of 
MHV nsplO has also indicated that this protein plays an es¬ 
sential role in viral RNA synthesis (14). 

In the absence of further information, such as would be 
provided by RNA cocrystals, we cannot yet use the abundance 
of available structural data to draw strong conclusions about 
the nature of the reverting mutations that suppress the Alb391 
pseudoknot insertion mutation. Each of the four nsp8 or nsp9 
reverting mutations introduces a more basic charge into its 
respective molecule (Fig. 4B and C). The exact loci of the nsp8 
mutations Q30R and D62N-T66I suggest that the mutations 
are most likely to affect the packing between monomers at the 
interface of the two nsp8 molecules in different conformations 
in the asymmetric unit of the nsp7-nsp8 hexadecamer. Fike- 
wise, the nsp9 mutations D58G and N60K may affect intersub¬ 
unit beta-sheet interactions that stabilize one type of dimer 
formation (56). In both cases, these alterations could indirectly 
influence the RNA-binding specificity or affinity of nsp8 or 
nsp9. Alternatively, the conformations of the nsp’s in a 
polyprotein precursor or in an RNA-bound form may differ 
from that revealed in the crystal structure, and thus, the in¬ 
creased positive charge caused by the reverting mutations 
might then directly enhance the affinity of nsp8 or nsp9. We are 
currently seeking in vitro biochemical data to corroborate 
these ideas. 

Very surprisingly, purified SARS-CoV nsp8 has recently 
been found to possess RdRp activity, and it has been proposed 
to function as a primase for the canonical coronavirus RdRp, 
nspl2 (30). The in vitro enzymatic activity of SARS-CoV nsp8 
has a dependence on manganese ion, and it exhibits a strong 
preference for cytidine-rich RNA templates, synthesizing small 
oligoribonucleotides of six or fewer residues. We have con¬ 
firmed these same properties for the MHV homolog of nsp8 
(S. J. Goebel and P. S. Masters, unpublished data). Imbert and 
coworkers (30), who discovered the RdRp activity of nsp8, 
have suggested that nsp8 could act at multiple points through¬ 
out the genomic template, although it must be noted that the 
characteristics of this enzyme may be substantially altered in 
the company of nsp7, nsp9, nsplO, and the authentic viral RNA 
substrate. In view of our suppressor results, we favor the idea 
that nsp8 acts as a primase during the initiation of negative- 
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FIG. 8. Hypothetical model of initiation of coronavirus negative-strand RNA synthesis. (A) At the outset, formation of the pseudoknot is 
precluded by (i) closure of the bottom segment of the bulged stem-loop (BSL) and (ii) the interaction between pseudoknot (PK) loop 1 (LI) and 
the 3' end of the genome. (B) The nsp8 primase, in conjunction with nsp7, nsp9, and nsplO, binds to the stem formed by loop 1 and the 3' end. 
(C) Synthesis of the 5' end of negative-strand RNA by the nsp8 primase separates the 3' end of the genome from loop 1, thereby allowing the 
pseudoknot to form (D). The nspl2 RdRp then binds to the pseudoknot, together with nspl3 (helicase), nspl4 (ExoN nuclease), nspl5 (NendoU 
nuclease), and nspl6 (methyltransferase). (E) The nspl2 complex carries out elongation of the negative-strand primer. Not shown are possible 
roles played by other nsp’s, the poly(A) binding protein, and the viral N protein. 


strand synthesis. The interactions of the MHV 3' pseudoknot 
loop 1 with the extreme 3' genomic terminus and with nsp8 and 
nsp9 would position a template RNA secondary structure and 
particular replicase components very close to the start site of 
negative-strand RNA synthesis, which initiates with production 
of a tract of oligo(U) some 9 to 26 residues into the poly(A) tail 
(25). It is thus conceivable that the primase activity of nsp8 is 
involved in the initiation of coronavirus negative-strand RNA 
synthesis before the canonical RdRp activity, encoded by the 
nspl2 gene, can resume negative-strand RNA synthesis by 
elongating the nsp8-derived primer molecules. This idea is 
supported by the fact that nspl2 can be targeted to replication 
complexes when expressed in trans as a fusion protein with 
green fluorescent protein in MHV-infected cells; moreover, 
nspl2 can be coimmunoprecipitated with nsp8 and nsp9 from 
infected cell lysates (10). 

Model for initiation of MHV negative-strand RNA synthe¬ 
sis. Our suppressor results, integrated with the finding of pri¬ 
mase activity of nsp8, thus lead us to propose a model for 
initiation of MHV negative-strand RNA synthesis, which is 
shown in Fig. 8. We speculate that the most stable conforma¬ 
tion of the 3' UTR RNA is one in which the bulged stem-loop 
is fully folded and the 3' end of the genome is annealed to 
pseudoknot loop 1. These two folding alternatives would pre¬ 
clude formation of stem 1 of the pseudoknot. The stem formed 


by loop 1 and the 3' end of the genome is postulated to serve 
as the site of assembly of a complex that includes at least nsp8 
and nsp9 and probably also nsp7 and nsplO. These replicase 
constituents may act as processed monomers, as shown. Alter¬ 
natively, they may be part of a much larger nsp4-nspll pre¬ 
cursor, which has been suggested to be the relevant species for 
negative-strand RNA synthesis (50). Thus, the MHV 3' 
pseudoknot in conjunction with the 3' end of the genome may 
constitute a cis -acting element that allows the specific recog¬ 
nition of the gRNA as the template for initiation of minus- 
strand RNA synthesis. Following substrate recognition, initia¬ 
tion of negative-strand RNA synthesis by the nsp8 primase 
complex leads to separation of the 3' end of the genome 
from loop 1, thereby allowing formation of stem 1 of the 
pseudoknot. Finally, the completely folded pseudoknot, in con¬ 
junction with nsp8 and nsp9, then recruits the nspl2 RdRp and 
the associated activities contained in nspl3 through nspl6, and 
the resulting complex resumes the elongation of negative- 
strand RNA. 

We are aware that many mechanistic details of this process 
remain to be elucidated. The proposed role of the MHV 
pseudoknot in association with the 3' genomic end bears some 
resemblance to the function of the 5' cis -acting element re¬ 
quired for the assembly and positioning of the poliovirus 
preinitiation RNA synthesis complex discussed above (4, 23). 
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However, it remains to be established whether cis -acting ele¬ 
ments located at the 5' end of the coronavirus genome also 
participate in this process. Although the notion of genome 
circularization has been previously invoked (29, 69), there is 
currently no clear evidence that coronavirus 5' and 3' genomic 
ends are bridged by viral or cellular proteins or by heretofore- 
unidentihed RNA-RNA interactions. Congruent with the anal¬ 
ogy to the poliovirus system, however, it has been demon¬ 
strated that the length of the poly(A) tail of coronavirus DI 
RNA affects the efficiency of replication (54). Finally, the exact 
composition of the coronavirus preinitiation complex remains 
to be determined; the complex conceivably assembles cotrans- 
lationally and involves more nsp’s than are pictured in our 
model. We hope that our discovery of an interaction between 
an essential 3' cis -acting pseudoknot, specific replicase gene 
products, and the extreme 3' end of the MHV genome will 
provide a starting point for further genetic, structural, and 
biochemical analyses aimed at establishing a more detailed 
understanding of the earliest events of coronavirus RNA syn¬ 
thesis. 
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